arXiv:1505.04045vl [q-fin.PM] 15 May 2015 


The final publications is now available at doi:10.1016/j.jenipfin.2015.03.003 


Portfolio optimization for heavy-tailed assets: 
Extreme Risk Index vs. Markowitz 

Georg Mainik * Georgi Mitov ^ Ludger Riischendorf ^ 

May 7, 2015 


Abstract 

Using daily returns of the S&P 500 stocks from 2001 to 2011, we 
perform a backtesting study of the portfolio optimization strategy 
based on the extreme risk index (ERI). This method uses multivari¬ 
ate extreme value theory to minimize the probability of large portfolio 
losses. With more than 400 stocks to choose from, our study seems 
to be the first application of extreme value techniques in portfolio 
management on a large scale. The primary aim of our investigation 
is the potential of ERI in practice. The performance of this strat¬ 
egy is benchmarked against the minimum variance portfolio and the 
equally weighted portfolio. These fundamental strategies are impor¬ 
tant benchmarks for large-scale applications. Our comparison includes 
annualized portfolio returns, maximal drawdowns, transaction costs, 
portfolio concentration, and asset diversity in the portfolio. In addi¬ 
tion to that we study the impact of an alternative tail index estimator. 

Our results show that the ERI strategy significantly outperforms both 
the minimum-variance portfolio and the equally weighted portfolio on 
assets with heavy tails. 
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1 Introduction 

In this paper we propose and test a portfolio optimization strategy that aims 
to improve the portfolio return by stabilizing the portfolio value. Minimiz¬ 
ing the probability of large drawdowns, this strategy can help to retrieve 
the portfolio value as good as possible also in times of high risk in the mar¬ 
kets. This intended performance is, of course, not a new aim in portfolio 
management, and it became even more vital since the default of Lehman 
Brothers in 2008. The following years of financial crisis have demonstrated 
that the technical progress of financial markets and their globalization have 
also brought up some new challenges. One of these challenges is the need for 
diversification strategies that account for strong drawdowns and increasing 
dependence of asset returns in crisis periods. This has raised the relevance 
of non-Gaussian models, tail dependence, and quantile based risk measures 
in portfolio optimization [SI El El EDI EH E2l EH EH EH ESI EH EH EH m] ■ 

Developments in theory and practice of portfolio opti¬ 
mization 

Since its introduction by Markowitz [SS]) fhe mean-variance approach became 
the industry standard for asset allocation. However, this popularity also 
brought up several technical issues in practical applications, and there has 
been a large amount of further development addressing them. 

One main direction of related research is dedicated to the impact of pa¬ 
rameter uncertainty on the investment performance. The high sensitivity 
of the estimated mean-variance efficient portfolio to estimation errors in the 
underlying distribution parameters (expectations and covariances of asset 
returns) may lead to highly non-robust results. Barry [2] and Chopra and 
Ziemba |S] show the high sensitivity in particular when estimating the ex¬ 
pected returns. Jorion [SHEHIEI] and Jagannathan and Ma [2^ find that the 
pure minimum variance (MV) portfolio may outperform the mean-variance 
efficient portfolio. 

Several approaches addressing the statistical challenge of parameter un¬ 
certainty have been suggested in the literature. These include the use of 
Bayesian and shrinkage estimators, shrinking the portfolios to some predeter¬ 
mined target which depends on combination of prior information with sample 
data (see, e.g., Jorion [221 ED])- Black and Litterman [3] suggest Bayes esti¬ 
mation of means and covariances. However, their findings on the superiority 
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of the Bayes/Stein procedure are not confirmed in some other studies like 
Fletcher and Leyffer [12] and Grauer and Hakansson [22] ■ DeMiguel et ah [2| 
and DeMiguel and Nogales [S] investigate the potential advantage of robust 
optimization and shrinkage estimators. The resulting picture is, however, 
not completely clear, and it turns out that even robustified and optimized 
procedures in some cases fail to outperform simple heuristic strategies like 
the equally weighted portfolio. 

Concerning robust asset allocation, Tiituncu and Koenig [12| look for ro¬ 
bust solutions that have the optimal worst-case performance, whereas Gold- 
farb and Iyengar ng choose worst-case estimators in a robust model frame¬ 
work that can be solved by linear programming. Herold and Maurer [22] 
observe that even these more stable estimation methods only outperform 
simple strategies when combined with regression models for the expected 
return. 

Another research direction includes several approaches to change the ob¬ 
jective function in the optimization problem underlying the investment strat¬ 
egy. One of the issues addressed here is that quantification of risk by variance 
does not distinguish between gains and losses. Hence, to avoid wrong con¬ 
clusions for asymmetrically distributed returns, application of pure downside 
risk measures is advantageous. Young [33] introduces an alternative opti¬ 
mization criterion based on minimum return instead of variance as measure 
of risk, and proposes a minimax approach. This corresponds to a utility 
principle with an extreme form of risk aversion on investor’s side. Ghaoui 
et al. [13] propose a worst-case Value-at-Risk and robustified programming 
approach based on only partial information about the return distributions, 
assuming that only bounds on the moments are known. Jarrow and Zhao 
[23] apply lower partial moments as risk measure for downside loss aversion 
and compare the resulting optimal portfolios with the mean-variance based 
ones. While both methods perform similarly on normally distributed returns, 
they can lead to significantly different results on returns with asymmetric, 
heavy-tailed distributions. 

Portfolio optimization based on the Extreme Risk Index 
(ERI) 

In our paper we follow the basic line of developments on the optimization 
problem that the investment strategy is derived from. Our reformulation of 
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the objective function in this optimization problem is based on extreme value 
theory, and it is specifically designed for portfolios with heavy-tailed assets. 
Extreme value theory is an adequate tool to improve the modelling of return 
tails. 

In contrast to the mean-variance optimization, our approach does not rely 
on existence of second moments for the return distribution. With increas¬ 
ingly heavy tails, variance and covariance estimators can become unreliable, 
or even the moment themselves may fail to exist. Thus the mean-variance ap¬ 
proach tends to face its limitations especially in crisis periods, when financial 
returns behave in their most extreme way. Several modifications addressing 
this issue have been discussed; see, e.g., Rachev et al. [HH] for the relevance 
of this type of heavy-tailed models. 

In the present study we apply a novel method based on extreme value the¬ 
ory to a portfolio optimization on real data. This study seems to be the first 
attempt in extreme-value based portfolio optimization on large scale. Our 
primary aim is to assess the general potential of extreme-value based methods 
in portfolio optimization. At this initial stage, we compare a very basic imple¬ 
mentation of our extreme-value approach with similarly basic and therefore 
relatively robust benchmarks. Our benchmarks are given by the minimum- 
variance portfolio (MV) and the equally weighed portfolio (EW), which in¬ 
vests the 1/N fraction of the total capital in each of N assets. According 
to our results, the extreme-value based method stays behind its benchmarks 
on assets with light tails, but outperforms each of them (MV and EW) on 
assets with moderately heavy or very heavy tails. As discussed above, outper¬ 
forming these simple methods on large scale is non-trivial even with refined 
estimation techniques. The advantage of the extreme-value based method is 
particularly strong in the case of heaviest tails, which the method is designed 
for. 

More specifically, the mathematical basis of our approach is laid out in 
Mainik and Riischendorf [55]. Our portfolio is obtained by minimizing the 
Extreme Risk Index (ERI), which quantifies the impact of heavy, dependent 
tails of asset returns on the tail of the portfolio return. We apply this strategy 
and the chosen benchmarks to the daily return data of the S&P 500 stocks 
in the period from November 2007 to September 2011. The computation of 
portfolio weights utilize the data from the six years prior to each trading 
day. To assess the impact of delays in portfolio rebalancing, we implement 
rebalancing not only on daily, but also on weekly basis. For the sake of 
stability, the portfolio estimates for both daily and weekly rebalancing are 
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based on daily data. In addition to the portfolio value we also track some 
other characteristics related to portfolio structure, degree of diversification, 
and transaction costs. 

In the first round of our backtesting experiments we apply ERI optimiza¬ 
tion to all S&P 500 stocks with full history in our data set (444 out of 500). In 
this basic setting the ERI based algorithm slightly outperforms the MV and 
EW portfolios with respect to annualized returns (6.8% vs. 5.8% and 5.3% 
for daily rebalancing). All methods significantly outperform the S&P500 
index, which has the annualized return of —5.2%. 

As next step we subdivide the stocks into three groups according to their 
tail characteristics. Our results show that ERI optimization is particularly 
useful for assets with heavy tails. On this asset group it clearly outperforms 
Markowitz and yields an annualized return of 11.5% for daily rebalancing. 
This is impressive compared to the 5.0% and 5.1% achieved with the MV and 
EW strategies, and even more so because the backtesting period includes the 
recent financial crisis. Tracking the portfolio turnover, we found that the 
ERI strategy tends to increase the transaction costs. However, the turnover 
of the ERI optimal portfolio for the group with heavy tails is lower than 
the turnover of the MV portfolio in the basic experiment without grouping. 
The performance of the EW portfolio is similar to that of the MV portfolio, 
especially on assets with heavy-tailed returns. 

Our major finding is that the ERI optimization significantly outperforms 
MV and EW portfolios for assets with very heavy tails. Furthermore, the 
structure of the ERI optimal portfolio is very different from its peers, espe¬ 
cially in the basic case with portfolio selection from all 444 assets considered. 
The ERI based portfolio is build from fewer assets, but nevertheless it shows 
better diversification as measured by principal component analysis. The 
overall picture for weekly rebalancing is similar. These results suggest that 
ERI optimization can be a useful alternative for portfolio selection in risky 
asset classes. In some sense, this strategy seems to earn the reward that the 
economic theory promises for the higher risk of heavier tails. 

A remarkable detail in this study is that none of the three compared 
methods (ERI, MV, EW) looks at expected returns. Nevertheless, each of 
them significantly outperforms the S&P 500 index, and the annualized return 
of the ERI strategy on heavy-tailed assets is surprisingly high if we keep in 
mind that the data we used includes the financial crisis of 2008 and 2009. 
The risk-orientated nature of the ERI strategy suggests that this result is 
due to improved detection and handling of risk in the portfolio. 
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Further improvement of the ERI-based portfolio optimization by incor¬ 
porating expected returns is analogous to the mean-variance setting. It can 
be done by adjusting the ERI-based optimization problem by a linear con¬ 
straint that reflects some target return. Theoretically, this should improve 
the performance of the ERI strategy even further. However, practical im¬ 
plementation of this extension faces same statistical challenges as for the 
Markowitz strategy with a target return. The literature discussed above 
suggests that outperforming the purely risk-orientated version of the ERI 
strategy would be non-trivial. 

The paper is organized as follows. The alternative portfolio optimization 
algorithm and its technical backgrounds are introduced in Section In 
Section]^ we give an outline of the data used in the backtesting study, define 
the estimator for the optimal portfolio, and introduce all additional portfolio 
characteristics to be tracked. Detailed results of the backtesting experiments 
are presented and discussed in Section Conclusions are given in Section 

2 Theoretical backgrounds 

2.1 Asset and portfolio losses 

Let Si(t) denote prices of assets Si, i = 1,... ,N, at times t = 0,1,... ,T. 
Focusing on the downside risk, let Xi{t) denote the logarithmic losses of the 
assets Si, 



( 2 . 1 ) 


and let Xi(t) denote the corresponding relative losses: 

~ _ S,{t - 1) - S,{t) _ S,(t - 1) 

^ ■ s,{t) S,{t) 


For daily stock returns, X* and Xi are almost identical because Xi is the 
first-order Taylor approximation to the logarithmic loss Xi. 

This approximation also extends to asset portfolios. Consider an invest¬ 
ment strategy (static or one-period) diversifying a unit capital over the assets 
Si,..., Sn. It can be represented by a vector w of portfolio weights, w G 
Hi := {x G : Y^^i^i = !}• Excluding short positions, the portfolio set 
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can be restricted to the unit simplex := {w E [0,1]'^ : Wi = 1}. This 

is the portfolio set we will work with from now on. Each component w* > 0 
corresponds to the fraction of the total capital invested in Si, and the relative 
portfolio loss is equal to the scalar product w^X{t) := of the 

portfolio vector w and the relative loss vector X(t) = (Xi(f ),..., X^it)): 

^ w- ~ 

E s;(Xt) 

Thus the scalar product w^X{t) for the logarithmic loss vector X{t) := 
(Xi(t), ...,Xisf(t)) is the first-order Taylor approximation to w'^X. This 
kind of approximation is also relevant to the Markowitz approach, which is 
typically applied to logarithmic returns. 


2.2 Multivariate regular variation 

To define the Extreme Risk Index (ERI) of the random vector X{t), we 
recollect the notion of multivariate regular variation (MRV). A random vector 
X = (Xi,... jXjv) is MRV if the joint distribution of its polar coordinates 
R ■= ||X||i := \Xi\ and Z := ||X||r^X satisfies 

B{{r~^R,Z)\R>r)^Pa<^^, r —)■ cxo, (2.3) 


where T is a probability measure on the 1-norm unit sphere Sp and pa is 
the Pareto distribution: pa{s,oo) = s > 1. The symbol ^ in (2.3) 
represents the weak convergence of probability measures, and the symbol (8) 
refers to the direct product of probability measures. The intuitive meaning 
of (2.3) is that, conditioned on i? > r for a sufficiently large r, the random 
variable r~^R is approximately Pareto(Q;) distributed and independent of Z, 
which is approximately T-distributed. 

Besides (2.3), there are several other equivalent definitions of MRV; for 
more details we refer to Resnick The parameter a > 0 is called tail 

index. It separates finite moments of R from infinite ones in the sense that 
ER^ < oo for (5 < a and ER^ = oo ioi (3 > a. In the non-degenerate case, 
the same moment explosion occurs for all components Xj of the random 
vector X. The measure T is called spectral (or angular) measure of X and 
describes the asymptotic distribution of excess directions for the random 
vector X. 
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Intuitively speaking, MRV means that the radius R has a polynomial tail 
and is asymptotically (i.e., for large R) independent of the angular part Z. 
Moreover, if a measurable set A C is sufficiently far away from the origin, 
i.e., if ||x||i > t for all a; G A with some large t, then 


P{X e sA) ~ s-^P{X e A) 


(2.4) 


for s > 1 and sA := {sx : x G A}. The scaling property (2.4) allows to 


extrapolate from large losses to extremely large ones, which even may be 
beyond the range of the observed data. Approximations of this kind are the 
key idea of the Extreme Value Theory (cf. Embrechts et ah [TH]). 

Many popular models are MRV. In particular, this is the case for mul¬ 
tivariate t and multivariate a-stable distributions (cf. Araujo and Gine 
PP, Hult and Lindskog |2S])- In the latter case, the stability index a is 
also the tail index, and the spectral measure characterizing the multivariate 


stability property is a constant multiple of T from (2.3). In all these models. 


the components Xj are tail equivalent in the sense that P{Xi > r)/P{Xj > 
r) —>■ Qj > 0 as r —)■ cx) for all i,j G {!,..., N}. This is equivalent to the 
following non-degeneracy condition for the angular measure T: 


Tlx G : Xj = 0} < 1 


for z = 1,..., X. 

It should be noted that the MRV assumption (2.3) is of asymptotic na¬ 
ture and that it is also quite restrictive. MRV models are often criticized 
for excluding even slightly different tail indices Oj for the components Xj. 
However, this criticism also affects the multivariate t and multivariate a- 
stable models, which are widely accepted in practice despite the resulting 
restriction to equal at. It is indeed true that, estimating the tail index a* for 
each component Xj separately, one would hardly ever obtain identical values 
for different i. But on the other hand, the confidence intervals for Oj often 
overlap, so that a MRV model may be close enough to reality and provide a 
useful result. 

The major reason why MRV models can be useful in practice is that 
the practical questions are non-asymptotic. In fact, it is not the restrictive 
asymptotic relation (2.3) that matters, but the scaling property (2.4). If (2.4) 
is sufficiently close to reality in the range that is relevant to the application, 
the eventual violation of (2.3) further out in the tails does not influence the 
result too much. 
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Practical applications often involve heuristics of this kind. In particular, 
if Si are stock prices and hence non-negative, then the relative losses X* are 
bounded by 1. Going sufficiently far out into the tail, one must observe quite 
different behaviours for the relative portfolio loss w'^X and the logarithmic 
approximation nFX. However, with typical daily return values in the low 
percentage area and values around 10% occurring only in crisis times, relative 
asset losses do exhibit polynomial scaling of the type 


P{X^>rs) 

P{X, > r) 


(2.5) 


This is illustrated in Figure [2Tt which shows QQ-plots of logarithmic S&P 500 
returns (same observation period as in our backtesting study) versus the nor¬ 
mal and the Student-t(3) distribution. The normal distribution is light-tailed, 
whereas the t{3) distribution satisfies (2.5) with a = 3 (more generally, a t 
distribution with u degrees of freedom satisfies (2.5) with a = v). The 
dashed, red lines mark the 0.4%, 10%, 90%, and 99.6% quantiles of the dis¬ 
tributions on the X axes (normal or f(3)). The area between the 0.4% and 
10% quantiles corresponds to the worst returns observed every 2 weeks (10 
business days) or once a year (about 250 business days). The area between 
the 90% and 99.6% quantiles corresponds to the best returns observed every 2 
weeks or once a year. This is the application range mentioned above. A good 
distributional fit makes the QQ-plot linear in this range. Figure 2.1 demon¬ 
strates clearly that the normal distribution gives a poor fit to the S&P 500 
return data, whereas the heavy-tailed t distribution fits much better. This 
picture depends neither on whether one takes the index or single stocks, nor 
on the observation period. Figure [2T] uses the same observation range as our 
backtesting study, but even shifting the observation window 10 or 20 years 
back into the past gives astonishingly similar results. 

Hence we are lucky to remain in the area where X and X can be treated 
as if they both were MRV, and the approximation w'^X ~ w^X works rea¬ 
sonably well. Thus, even though the scaling property (2.5) eventually breaks 
down if rs gets too close to 1, it has some useful consequences in the appli¬ 
cation range. This is confirmed by our backtesting results. 
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normal quantiles 



Figure 2.1 QQ-plots for the logarithmic returns of the S&P500 index vs. normal and 
t(3) distribution. The dashed, red lines mark the 0.4%, 10%, 90%, and 99.6% quantiles. 


2.3 Portfolio optimization via Extreme Risk Index 


The MRV assumption (2.3) implies that 


P{w'^X > r) 
™F(||X||i >r) 





zY d^{z) 


( 2 . 6 ) 


(cf. ESI and E31 Lemma 2.2). This implies that for any portfolio vectors 
v,w E and large r > 0 


F(n^X > r) ^ 7^ 
P{w^X > r) - 

Moreover, for A < 1 close to 1 one obtains that 

VaRi(«)’'A') ~ 


(2.7) 


( 2 . 8 ) 


(cf. ES] and ESI Corollary 2.3). Here and in what follows we define the Value- 
at-Risk VuRa of a random loss X at confidence level A as the A-quantile of 
X: 

VaRA(X) := inf{x G M : F(X < x) > A}. 

Roughly speaking, VaRA(X) is the smallest x such that X < x holds with 
probability A. Typical values of A are 95%, 99%, and 99.5%. 
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Motivated by ( 2.7| ) and (2.8), the functional 7 ^ = 7 ^(\l',a) is called Ex¬ 
treme Risk Index (ERI). Minimizing the function w 1 —)■ 7 ^, one obtains a 
portfolio that minimizes the loss for large ||27||, i.e., in case of crisis events. 
In precise mathematical terms, one minimizes VaRA(w^X) for A —?■ 1. The 
practical meaning of this procedure is the utilization of the scaling prop¬ 


erty (2.4) to obtain a portfolio that minimizes the downside risk during a 


market crash. This approximate result is not perfect, but it can be a step 
into the right direction. 


Based on the integral representation (2.6), the following portfolio opti¬ 


mization approach was proposed in Mainik and Riischendorf 


Estimate 7 ^ by plugging appropriate estimates for a and T into (2.6); 


Estimate the optimal portfolio by minimizing the resulting estimator 
with respect to w. 


The general properties of the optimization problem are discussed in Mainik 
and Riischendorf [3^, Mainik [32], and Mainik and Embrechts [31|. In par¬ 
ticular, it is known that the function tc 1 —)■ 7 .^, is convex for a > 1. Thus, 
given that the expectations of Xj are finite, a typical optimal portfolio would 
diversify over multiple assets. The consistency of the plug-in estimator 7 ^ 
and of the resulting estimated optimal portfolio w* in a strict theoretical 
sense is studied in Mainik [3311331 , Mainik and Riischendorf [33] • 


3 Outline of the backtesting study 

3.1 The data 


The contribution of the present paper is a backtesting study of the ERI based 
portfolio optimization approach on real market data. Our data set comprises 
all constituents of the S&P 500 market index that have a full history for 
the period of 10 years back from 19-Oct-2011. These are 444 stocks out of 
500. For each date of the backtest period 19-0ct-2007 to 19-Oct-2011 the 
estimation of the optimal portfolio is based on the 1500 foregoing observations 
- approximately 6 years of history - for all stocks back in time. For example, 
the optimal portfolio for 19-0ct-2007 is estimated from the stock price data 
for the period (19-0ct-2001 to 18-0ct-2007). 

Our computations are based on the logarithmic losses Xi(t) as defined 
in ( 2 . 1 ). As already mentioned above, we exclude short positions. This 
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basic framework is most natural for the comparison of portfolio strategies. 
The asset index i varies between 1 and N = 444, and the time index t 
takes values between 1 and T = 2509 (1500 days history + 1009 days in the 
backtest period). To estimate a and T, we transform the (logarithmic) loss 
vectors X{t) into polar coordinates 

Z{t)) = (||X(f)||i, ||X(t)||r'X(f)), f = 1,... ,T. 

3.2 The estimators and the algorithms 

We estimate a by applying the Hill estimator to the radial parts R{t): 


LX ^ j -- I O. J- I 

Ej=i log(i?(j),t/ R{k+I),t) 

where t > 1500 and R(i),t > • • • > R{i 5 oo),t is the descending order statistic 
of the radial parts R{t — 1500),. .., R{t — 1) and k = 150. That is, out of the 
1500 data points in the historical observation window t—1500,..., t—1 we use 
the 10% with largest radial parts. Going back to Hill [2S], the Hill estimator 
is the most prototypical approach for the estimation of the tail index a. The 
choice of k determines which observations are assumed to describe the tail 
behaviour. Another important criterion for the choice of k is the trade-off 
between the bias, which typically increases for large fc, and the variance of the 
estimator, which increases for small k. In addition to the static 10%-rule we 
also consider the adaptive approach proposed in Nguyen and Samorodnitsky 
127!. See Danielsson et ah |7], Drees and Kaufmann m, Resnick and Starica 
jH] for further related methods. 

As proposed in Mainik and Riischendorf [35], we estimate 4/ by the em¬ 
pirical measure of the angular parts from observations with largest radial 
parts. More specifically, we use the same 10% data points (the so-called tail 
fraction) in the moving observation window that were used to obtain a. The 
resulting estimator 7 ^ is 


1 ^ 

%it) = -r 

where ij^t is the sample index of the order statistic R{j),t in the full data set: 
= Rii,,t), J = 1,..., 1500, t = 1501,... ,T. 
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The resulting estimate of the optimal portfolio w*{t) on the trading day 
t is the portfolio vector w G that minimizes 7 «,(t): 


= min 7 ^(f). 


(3.2) 


Finally, the estimated optimal portfolio w*{t) is used to compose the 
portfolio for the trading day t. The resulting (relative) portfolio return is 


calculated by substituting w*{t) in (2.2). 

The procedure outlined above is repeated for all trading days t > 1500. 
For instance, the optimal portfolio for 22-Oct-2007 is based on the observa¬ 
tion window from 29-Oct-2001 to 21-0ct-2007, whereas for 23-Oct-2007 we 
use the observation window from 30-0ct-2001 to 22-Oct-2007, and so on. 

The benchmarks for this portfolio optimization algorithm are given by 
the equally weighted (EW) portfolio assigning the weight of 1/iV = 1/444 
to each asset and the the minimum variance (MV) portfolio. Analogously 
to the ERI optimal portfolio, the MV portfolio is calculated from logarith¬ 
mic asset returns, with the same moving observation window of 1500 points 
and empirical estimators for the covariance matrix. Similarly to the ERI ap¬ 
proach, our implementation of the Markowitz approach chooses the portfolio 
with minimal risk, i.e. with minimal variance. That is, given an estimator 
C of the covariance matrix of the asset returns Si,..., Sn, the estimated 
minimum variance portfolio tCMV is obtained by minimizing the function 

w H->■ vj^Cw 


for w G A^. That is, we do not include an additional linear constraint 


uFp, = 


(3.3) 


with an estimator p of the daily return and a target return fl > 0. 

There are two reasons for this choice. On the one hand, ERI minimiza¬ 
tion is also a pure risk minimization procedure, so that ignoring estimates of 
the expected returns in the Markowitz benchmark increases the fairness of 
competition. Furthermore, since the ERI approach only changes the quan¬ 
tification of risk and does not yet change the view on gains, it is natural 
to study its effect in a purely risk orientated setting. Endowment of the 
ERI approach with a target return is straightforward. Analogously to the 


Markowitz approach, it suffices to add the linear constraint (3.3) to the op 


timization problem (3.2). 
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On the other hand, computation of a Markowitz efficient portfolio with 


target return constraint (3.3) would require estimation of expected asset 


returns and bring in all the technical issues discussed in Section The 
same issues must appear in an extended ERI application that includes (3.3). 


In practice, these technical issues can dominate the theoretical performance 
improvement associated with a target return. 


4 Empirical results 

4.1 Basic setting for entire set of stocks 

We start with the most crude application of the ERI minimization strategy, 
estimating the tail index from the radial parts of the random vector (Xi,..., 
X 444 ) of all stock retrains involved in our study. The resulting estimate 
a = a{t) varies in time, but it is applied to all N = 444 stocks as if their 
joint distribution were MRV. This is a very courageous assumption, but 
even in this case we see some useful results. A first impression of these 


results is given in Figure 4T, where the value of the ERI optimal portfolio is 
compared to the performance of its peers (MV and EW) and to the S&P 500 
index. The graphic suggests that the value of the ERI based portfolio is more 
stable during market crashes. On the other hand, the MV portfolio seems 
to catch up again during recovery periods. Markowitz approach also tries to 
assess potential gains. The cumulative returns achieved with ERI, MV, and 
EW, are similar in this setting, but still with some advantage for the ERI 
based portfolio. Thus it seems that the ERI strategy - even in its crudest 
implementation - has a potential to stabilize the portfolio value in crises. 
The overall performance of the EW portfolio is very similar to that of the 
MV portfolio, but with lower Sharpe ratio and higher drawdowns. Thus it 
suffices to consider the MV benchmark in the present setting. 

All actively traded strategies benchmarks (ERI, MV, EW) clearly outper¬ 
form the S&P 500 index. The real dimension of this advantage is, however, 
not obvious, because for simplicity of implementation we apply ERI, MV, and 
EW only to the 444 stocks that remain in the S&P 500 through the whole 
observation period. Since this information about future developments is not 
available in reality, there may be a survivor bias increasing the performance 
of the three actively traded benchmarks. However, the resulting comparison 
between ERI, MV, and EW should be fair because each of them is applied 
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Portfolio Optimization Backtest - Daily Rebalancing 



Figure 4.1 Portfolio optimization backtest for the ERI minimization strategy under the 
assumption that all all stock returns have the same tail index a. The resulting portfolio 
value of the ERI strategy and its peers (MV, EW, and S&P 500) is scaled to 100 for the 
first date of the backtest period. 


to the 444 survivor stocks. 

Further characteristics of the basic ERI approach compared to its peers 
are shown in Table 4T The numbers show that the ERI strategy indeed 
outperforms MV and EW portfolios in many respects. In particular, the 
ERI optimal portfolio gives higher cumulative returns and a higher Sharpe 
ratio, whereas the maximal drawdown is lower than with the MV strategy. 
An extension of the Sharpe ratio based on the Expected Shortfall (ES) is the 
STARR ratio (cf. Rachev et ah 


where Vf is the risk-free interest rate and A is a confidence level close to 1. 
The backtested STARR is also higher for the ERI strategy than for the MV 
approach. The computation of the Sharpe and STARR ratios is based on 
empirical estimators for the expectation and for the Expected Shortfall. In 
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ERI 

MV 

EW 

S&P 500 

CR (Cumulative Return) 

30.07% 

25.48% 

23.24% 

-19.38% 

AR (Annualized Return) 

6.76% 

5.81% 

5.34% 

-5.22% 

AS (Annualized Sharpe) 

0.4715 

0.3469 

0.3229 

-0.0462 

AST (Annualized STARR 0 . 95 ) 

0.1926 

0.1410 

0.1318 

-0.0187 

MD (Max Drawdown) 

46.61% 

58.61% 

63.27% 

56.34% 

AC (Average Concentration Coefficient) 

8.69 

127.22 

444 

N/A 

AT (Average Turnover) 

0.0400 

0.0272 

0.01 

N/A 

PCA (First PCA factor Explained Variance) 

31.32% 

35.48% 

38.80% 

N/A 


Table 4.1 Backtest statistics for the ERI minimization strategy in the basic setting (ap¬ 
plied to all stocks at once) vs. Minimum Variance (MV), Equally weighted portfolio (EW), 
and S&P 500. 


particular, the estimate of ES 0.95 over the backtesting period of 1009 days is 
based on 51 largest observations of the portfolio loss. Since a risk-free rate on 
a daily scale is both difficult to determine and negligibly small, we set rj = 0 . 


The annualized Sharpe and STARR ratios reported in Table 4T and all other 
tables across the paper are obtained from daily ratios by multiplying them 
with the factor a/252. This heuristic approach is based on the assumption 
hat the calendar year has T = 252 business days and the returns scale over 
time with factor T, whereas the yearly volatility and Expected Shortfall scale 
with factor \/T. The resulting annualized values are very rough approxima¬ 
tions, but with 10 years of data, more reliable estimation of yearly returns, 
volatilities, and Expected Shortfall is not feasible. 

To measure the portfolio stock concentration, we compute the Concen¬ 
tration Coefficient (CC). It is defined as 


cc(«) - !:«.?(«) 


( 4 . 1 ) 


k 2 = l 


where Wi{t) is the relative weight of the asset i in the investment portfolio at 
time t. Conceptually, this approach is well known in measures of industrial 
concentration, where it is called as the Herfindahl-Hirschman index (HHI). 
Brandes Institute introduced the concentration coefficient by inverting the 


16 


@2015, Elsevier. Licensed under the Creative Commons 
Attribution-NonCommercial-NoDerivatives 4-0 International 
http://Great ivecommons. org/ licenses/hy-nc-nd/ 4- 0/ 





















The final publications is now available at doi:10.1016/j.jenipfin.2015.03.003 


HHI. 

The CC of an equally weighted portfolio is identical with the number 
of assets. As the portfolio becomes concentrated on fewer assets, the CC 
decreases proportionally. The numbers in Table |4.1| indicate that the ERI 


strategy is quite selective, whereas the number of stocks in the MV portfolio 
is on the same scale with the total number of assets. 

To assess the level of diversification provided by each optimization algo¬ 
rithm, we performed Principal Component Analysis (PCA) over the returns 
of all stocks relevant to the corresponding portfolios. We defined relevance 
via portfolio weights assigned by the algorithms and restricted PCA to the 
stocks with portfolio weights higher than 0.01%. Then we estimated the por¬ 
tion of the sample variance explained by the first PCA factor and averaged 
these daily estimates over the backtesting period. The lower the average 
portion of sample variance explained by the first PCA factor, the higher is 


the portfolio diversification. The numbers in Table 4T are quite surprising: 
despite the significantly higher concentration, the diversification level of the 
ERI based portfolio is higher than that of the MV strategy. 

The only performance characteristic where ERI stays behind MV is the 
portfolio turnover, which is a proxy to the transaction costs of a strategy. 
We use a definition of portfolio turnover that is based on the absolute values 
of the rebalancing trades: 

n 

i=l 

where Wi{t) is the (relative) portfolio weight of the asset i after rebalancing 
(according to the optimization strategy) at time t, and Wi(t_) is the portfolio 
weight of the asset i before rebalancing at time t, i.e., at the end of the trading 
period t — 1. Averages of r(f) over all t in the backtesting period are given 
The average turnover of the ERI optimal portfolio (0.0400) is 


in Table 4.1 


higher than that of the minimum variance portfolio (0.0272). 

Some technical details. For the calculation of the portfolio value we use 
relative returns and do not expect much difference when using logarithmic 
approximations. In the calculation of STARR and Sharpe ratio we do not 
use risk free rates since these are very small on a daily basis and thus have 
little influence on the the ratio calculations. For the estimation of ES in 
STARR we use the average of all sample values smaller than the 95% VaR of 
the sample. Our backtest period is of length 1009 and thus the ES estimate 
is based on n = 51 observations. 
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4.2 Grouping the stocks with similar ot 

In the previous section we treated all stocks as if their (logarithmic) returns 
Xj had the same tail index a. This simplification can influence the quan¬ 
titative and qualitative results. To obtain a better insight, we divide the 
stocks into three different groups with respect to their individual a and com¬ 
pare the performance of the portfolio optimization strategies on each of these 
groups. Figure [4^ shows the histogram of the estimates of the tail index a 
for different stocks on the first day of the backtesting period {t = 1501). 



Figure 4.2 Estimated values of the tail index a for different stocks on the first day of the 
backtesting period 

We consider the following groups: 

1 . all stocks with a < 2.2 

2 . all stocks with a G (2.2, 2.6) 

3. all stocks with a > 2.6 

The first group contains 134, the second 243, and the third one 67 stocks. 
These groups remained static during the backtesting period. That is, the 
estimated a on the first day of the backtesting period determines in which 
group each stock is placed. 
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Portfolio Optimization Backtest - Daily Rebalancing 



Figure 4.3 Portfolio optimization back¬ 
test. Stocks with a < 2.2 



ERI 

MV 

EW 

S&P 500 

CR 

54.70% 

21.58% 

22.30% 

-19.38% 

AR 

11.48% 

4.99% 

5.14% 

-5.22% 

AS 

0.6623 

0.3546 

0.3182 

-0.0462 

AST 

0.2695 

0.1430 

0.1299 

-0.0187 

MD 

53.67% 

48.03% 

61.32% 

56.34% 

AC 

7.4499 

10.9712 

134 

N/A 

AT 

0.0269 

0.0154 

0.01 

N/A 

PCA 

35.02% 

33.33% 

35.17% 

N/A 


Table 4.2 Backtest statistics. Stocks with 
a < 2.2 


Selection from the set of stocks with a < 2.2 


The backtesting results on stocks with tail index a < 2.2 are summarized in 
Figure |4]3 and Table 4^ In this case ERI minimization clearly outperforms 
its peers and yields an impressive annualized return of 11.48%. This is more 
than the double of roughly 5% achieved with the MV or with the EW port¬ 
folio. The overall performance of the EW portfolio is again similar to that of 
the MV portfolio, but with greater drawdowns. Thus it suffices to consider 
the MV benchmark in this case. 

The Sharpe and STARR ratios of the ERI strategy are also clearly higher 
than with MV. The concentration of both portfolios is on the same scale, but 
still a bit higher for the ERI based one. Similarly to the basic backtesting 
set-up on all S&P 500 stocks, the ERI strategy produces a higher portfolio 
turnover (0.0269 vs. 0.0154 with MV). However, both values are lower than 
the average turnover of the MV portfolio in the basic setting (0.0272). 

These results suggest that the ERI strategy is particularly useful for op¬ 
timizing portfolios of stocks with heavy tails, in our case of 134 out of 444 
stocks. This is to be expected since the ERI methodology was developed 
for heavy-tailed MRV models. Beyond that, there is also a statistical reason 
for the inferior performance of the MV approach in the present setting. Es¬ 
timation of covariances becomes increasingly difficult for heavier tails, and 
for a < 2 the covariances (and hence correlations) do not even exist. Thus 
empirical covariances used in the Markowitz approach can push the investor 
into the wrong direction. 
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Selection from the set of stocks with a G (2.2, 2.6) 


Portfolio Optimization Backtest - Daily Rebalancing 



Figure 4.4 Portfolio optimization back¬ 
test. Stocks with a S (2.2, 2.6) 



ERI 

MV 

EW 

S&P 500 

CR 

35.87% 

31.00% 

22.28% 

-19.38% 

AR 

7.93% 

6.96% 

5.14% 

5.22% 

AS 

0.5448 

0.3711 

0.3170 

-0.0662 

AST 

0.2306 

0.1517 

0.1301 

-0.0187 

MD 

45.56% 

57.70% 

63.89% 

56.34% 

AC 

7.3987 

1.00 

243 

N/A 

AT 

0.0249 

0.0000 

0.01 

N/A 

PCA 

32.78% 

ioo.6o% 

40.24% 

n7a 


Table 4.3 Backtest statistics. Stocks with 
a G (2.2, 2.6) 


If the stock selection is restricted to those with a between 2.2 and 2.6, the 
annualized return of the ERI based portfolio (7.93%) is somewhat above the 
MV and EW benchmarks (6.96% and 5.14%, respectively). While the returns 
are on the same scale, the volatilities of the MV and the EW portfolios rare 
much higher. Thus ERI optimization clearly outperforms its peers in terms of 
Sharpe ratio, STARR (both higher for ERI), and maximal drawdown (lower 
for ERI). It is somewhat astonishing that the PCA of the MV portfolio is 
100 %, i.e. the minimum variance algorithm selects only one stock. 


Selection from the set of stocks with a > 2.6 

For stocks with a > 2.6 (and hence lightest tails), the performance of the 
ERI minimization strategy stays behind MV and EW in terms of annualized 
return, Sharpe ratio, STARR, and turnover. The maximal drawdown is 
similar for ERI and MV, and higher for the EW portfolio. The diversification 
level in terms of PCA is similar for all three competing strategies. The 
portfolio concentrations resulting from the ERI and the MV approaches are 
on the same level, and slightly higher for the ERI optimal portfolio. 

Thus the impressive advantage of the ERI minimization strategy seems to 
be restricted to stocks with pronounced heavy-tail behaviour. This advantage 
turns into near parity for stocks with moderately heavy tails. For light-tailed 
stocks the MV strategy yields higher annualized returns with a similar draw¬ 
down, and the EW portfolio even higher returns, but also a significantly 
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Figure 4.5 Portfolio optimization back¬ 
test. Stocks with a > 2.6 



ERI 

MV 

EW 

S&P 500 

CR 

3.47% 

13.62% 

25.27% 

-19.38% 

AR 

0.85% 

3.23% 

5.77% 

-5.22% 

AS 

0.1397 

0.2636 

0.3367 

-0.0462 

AST 

0.0581 

0.1114 

0.1382 

-0.0187 

MD 

42.58% 

43.43% 

64.89% 

56.34% 

AC 

3.4817 

4.4715 

67 

N/A 

AT 

0.0165 

0.0091 

0.01 

N/A 

PCA 

54.92% 

52.43% 

47.52% 

N/A 


Table 4.4 Backtest statistics, a > 2.6 


higher maximal drawdown. These findings perfectly accord with model as¬ 
sumptions underlying these two methodologies: MV uses covariances, and 
ERI minimization is particularly applicable in cases when covariances do not 
exist or cannot be estimated reliably. On the other hand, ERI minimiza¬ 
tion strongly relies on the estimation of the tail index a, which is known to 
become increasingly difficult for lighter tails - see, e.g., Embrechts et al. m- 


4.3 Backtesting with an alternative estimator for a 


To assess the suitability of the estimator we used for a, we repeated our back¬ 
testing experiments with another estimation approach. The Hill estimator 


in (3.1) uses the tail fraction size fc as a parameter. The foregoing results are 


based on a static 10% rule, i.e. k = 150. It is well known that the choice of 
the tail fraction size k can have a strong influence on the resulting estimates 
- see, e.g., Embrechts et al. na. Thus, as an alternative to the static 10% 
rule, we tried the recent adaptive approach by Nguyen and Samorodnitsky 
bh. which involves sequential statistical testing for polynomial tails. The 
results of this backtesting study are outlined below. 


Optimization over the entire set of stocks 


Figure 4.6 and Table 4.5 represent the basic setting without grouping the 


stocks according to the estimated tail index a. It is a bit surprising that the 
adaptive choice of the tail fraction size k does not improve the performance 
of the ERI based strategy. The annualized return is significantly lower than 
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Portfolio Optimization Backtest - Daily Rebalancing 



Figure 4.6 Alternative estimator a: 
portfolio optimization backtest in the ba¬ 
sic set-up (on all S&P 500 stocks) 



ERI 

MV 

EW 

S&P 500 

CR 

11.76% 

25.48% 

23.24% 

-19.38% 

AR 

2.81% 

5.81% 

5.34% 

-5.22% 

AS 

0.2360 

0.3469 

0.3229 

-0.0462 

AST 

0.0939 

0.1410 

0.1318 

-0.0187 

MD 

51.39% 

58.61% 

63.27% 

56.34% 

AC 

64.33 

127.22 

444 

N/A 

AT 

o'.ossT 

0.0272 

0.01 

n'/a 

PCA 

46.48% 

35.49% 

38.80% 

N/A 


Table 4.5 Alternative d: backtest statistics 
in the basic set-up. 


with the static 10% rule. The overall result clearly stays behind the MV 
and the EW benchmarks. The only aspect where ERI is still better is the 
maximal drawdown, but it cannot compensate for the lower overall return. 
The reason for this outcome is the lower value of the tail fraction size k that 
is selected by the adaptive approach. Typical values are about 25, and all 
values are lower than 150 that come from the static 10% rule. Thus the 
adaptive approach looks too far into the tail, where the scaling of excess 
probabilities may already be different from the scaling in the application 
range. 


Grouping the stocks according to the estimated a 


As next step, we grouped the stocks according to their estimates. On av¬ 
erage, the Nguyen-Samorodnitsky estimator gave higher values of a, i.e., it 
indicated lighter tails than the static 10% rule. Therefore we chose a dif¬ 
ferent grouping of the a values: a < 2.7, a G (2.7,4.5), and a > 4.5. The 
backtesting results are presented in Table 4.6[ 

In all three cases the annualized return of the ERI strategy is lower than 
that of the MV portfolio. Interestingly, the worst performance of the ERI 
based strategy occurs in the middle group, and not in the group with lightest 
tails. Possible explanations here may be the different composition of the three 
groups (heavy, moderate, or light tails) and also the different values of a used 
in the portfolio optimization algorithm. 

All in all we can conclude that adaptive (and thus fully automatized) 
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Portfolio Optimization Backtest - Daily Rebalancing 




ERI 

MV 

EW 

S&P 500 

CR 

11.33% 

18.75% 

24.76% 

-19.38% 

AR 

2.71% 

4.37% 

5.66% 

-5.22% 

AS 

0.2317 

0.3260 

0.3354 

-0.0462 

AST 

0.0958 

0.1351 

0.1362 

-0.0187 

MD 

47.76% 

45.79% 

61.77% 

56.34% 

AC 

9.02 

10.09 

107 

N/A 

AT 

0.0385 

0.0148 

0.01 

N/A 

PCA 

43.65% 

30.87% 

33.46% 

N/A 


(a) a < 2.7 


Portfolio Optimization Backtest - Daily Rebalancing 




ERI 

MV 

EW 

S&P 500 

CR 

17.67% 

31.97% 

24.67% 

-19.38% 

AR 

4.14% 

7.15% 

5.64% 

-5.22% 

AS 

0.3015 

0.4578 

0.3318 

-0.0462 

AST 

0.1202 

0.1852 

0.1365 

-0.0187 

MD 

46.26% 

49.93% 

63.26% 

56.34% 

AC 

48.16 

28.09 

252 

N/A 

AT 

0.0384 

0.0201 

0.01 

N/A 

PCA 

4^29% 

32.56% 

40.25% 

N/A 


(b) a G (2.7,4.5) 


Portfolio Optimization Backtest - Daily Rebalancing 



(c) 



ERI 

MV 

EW 

S&P 500 

CR 

15.23% 

17.02% 

15.12% 

-19.38% 

AR 

3.59% 

3.99% 

3.57% 

-5.22% 

AS 

0.2699 

0.2991 

0.2810 

-0.0462 

AST 

0.1155 

0.1233 

0.1143 

-0.0187 

MD 

46.43% 

48.29% 

65.23% 

56.34% 

AC 

6.73 

3.98 

85 

N/A 

AT 

0.0326 

0.0113 

o'.02 

N/A 

PCA 

58.04% 

4^34% 

4l54% 

N/A 


a > 4.5 


Table 4.6 Alternative a: backtest statistics on stocks 


23 


@2015, Elsevier. Licensed under the Creative Commons 
Attribution-NonCommercial-NoDerivatives 4-0 International 
http://Great ivecommons. org/ licenses/hy-nc-nd/ 4- 0/ 











































































The final publications is now available at doi:10.1016/j.jenipfin.2015.03.003 


choice of the tail fraction size k can be problematic in real applications. 
This can be explained by the tail orientation of the Nguyen-Samorodnitsky 
approach. Roughly speaking, it tests for polynomial tails and chooses the 
largest value of k for which the test is still positive. While this is perfectly 
reasonable for data from an exact MRV model, there are at least two reasons 
why this method can fail on real data. First, if the data fails to satisfy 
the MRV assumption far out in the tail, the subsequent testing for small 
values of k can be misleading. The second reason was already discussed in 
Section 2.2 If the polynomial scaling changes for different severities, then 


the scaling behaviour of the distribution in the application area can differ 
from what is suggested by the true, but too asymptotic tail index. Our 
backtesting results show that these issues are highly relevant in practice. 

4.4 Behaviour of portfolio characteristics over time 

We conclude our analysis by a comparison of the ways the competing port¬ 
folios behave over time. This allows for deeper insight and allows to discover 
some more points of difference. 


Concentration and portfolio composition 

We start with the development of the concentration coefficient (CC) intro¬ 


duced in (4.1). Its behaviour over time in the basic set-up (no grouping of 
stocks according to a) is shown in Figure 4.7 This graphic shows that the 


number of stocks in the MV portfolio is permanently about 10 times higher 
than in the ERI optimal portfolio. The CC oscillation pattern suggests that 
the ERI based portfolio is more volatile in the crisis and much less volatile 
in benign periods. 

This impression is confirmed by Figure |4.8 The dynamics of the MV 


Portfolio in Figure 4^ is similar, but the difference between the crisis and 
recovery period is somewhat weaker. All in all it seems that the minimum 
variance portfolio undergoes many small changes, whereas the changes in the 
ERI optimal portfolio are less but much stronger. 


Turnover 

The impression about stronger changes in the ERI portfolio accords with 
the findings on the average portfolio turnover in Sections 4.1| and 4.2 The 
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Figure 4.7 Concentration Coefficient in 
the backtesting experiment on all S&P 500 
stocks. Total set of stocks with 10% thresh¬ 
old alpha estimation. 





Figure 4.8 ERI optimal weights in back¬ 
testing on all S&P 500 stocks 



Figure 4.9 Weights of the minimum variance portfolio in backtesting on all S&P 500 
stocks 


development of the turnover coefficient over time is shown in Figure 4.10 


The larger the spikes in the turnover pattern, the greater the instantaneous 
portfolio shift. The difference between the ERI minimization and the MV 
portfolio in the crisis period is remarkable. The turnover pattern of the MV 
portfolio points to a lot of small portfolio changes that lead to permanent, 
but moderate trading activity. The pattern of the ERI based portfolio has a 
lower level of basic activity, but much greater spikes corresponding to large 
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portfolio shifts. Thus, if carried out immediately, the restructuring of the ERI 
optimal portfolio requires more liquidity in the market. This disadvantageous 
feature can be tempered by splitting the transactions and distributing them 
over time. The tradeoff between fast reaction to events in the market and 
liquidity constraints is an interesting topic for further research. 





Figure 4.10 Portfolio turnover 





Figure 4.11 Variance explained by the first 
PCA factor 


Diversification measured by PCA 


The development of the first PCA factor over time is shown in Figure 4.11 


The amount of portfolio variance that can be explained by the first PCA 
factor increases in the months after the default of Lehman Brothers to a new 
level. This shows that the recent financial crisis has changed the perception 
of dependence in the market and thus increased the dependence between the 
stocks. Ranging below 25% before the crisis, the first PCA factors of both 
strategies are typically above 35% afterwards. This chart indicates a change 
in the intrinsic market dynamics. The stronger co-movements of S&P 500 
stocks reflect the new perception of systemic risk. As a consequence, the 
diversification potential in the after-crisis period is lower than in the time 
before the crisis. 

Most of the time, the first PCA factor of the ERI optimal portfolio ranges 
somewhat below that of the Markowitz portfolio. Thus we can conclude that 
ERI optimization brings more diversity into the portfolio than the mean-vari¬ 
ance approach. To the use of PCA: There is one exception to this rule: in 
February 2009, the first PCA factor of the ERI optimal portfolio peaks out 
to 100%. It corresponds to a single day when the ERI strategy selects only 
one stock for the investment portfolio. On this remarkable day, the first PCA 
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factor is obviously identical with the investment portfolio. Recalculations let 
to slightly different weights but to almost identical portfolio returns. Results 
of this kind can be avoided in practice by appropriate bounds on portfolio 
restructuring. 

Tail index estimates 





Figure 4.12 Estimated tail index a for the 
radial part of S&P 500 stocks and three sub¬ 
groups 





Figure 4.13 Estimated tail index a for the 
S&P 500 stocks 


In addition to the backtesting studies where the tail index a is estimated 
for the radial part of the random vector X, we also estimated a for each 
stock separately. The estimated values of a for the radial parts are shown in 
Figure 4.12 and the development of a estimates for single stocks is shown in 
Figure 4.13 Beginning in summer 2008, there is a common downside trend 
for all stocks, i.e. all return tails become heavier in the crisis time. This 
trend stops in spring 2009. The missing recovery since then can be explained 
by the width of the estimation window. Based on the foregoing 1500 trading 
days, our estimators remain influenced by the crisis for 6 years. This effect 
is visible in both figures. In addition to that. Figure 4.12 shows that after 
the crisis the estimated values of a in all three sub-groups are very close to 
each other and even change their ordering compared to the pre-crisis period: 
the group with lowest a before crisis does not give the lowest a after the 
crisis. These effects may be explained by the strong influence of extremal 
events during the crisis on the estimates in the after-crisis period. As the 
historical observation window includes n = 1500 days, the crisis events do 
not disappear from this window until the end of the backtesting period. It 
seems that the estimated values of a tend to ignore the recovery of the stocks 
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in the after-crisis period. This may be one more explanation to the different 
performance of the ERI strategy in the different stock groups. This effect 
can be tempered by downweighting the observations in the historical window 
when they move away from the present time. The choice of this weighting 
rule goes beyond the scope of this paper and should be studied separately. 


Portfolio Optimization Backtest - Daily Rebalancing 




ERI 

MV 

EW 

S&P 500 

CR 

27.77% 

21.73% 

18.69% 

-9.38% 

AR 

6.31% 

5.03% 

4.37% 

-5.22% 

AS 

0.4864 

0.3216 

0.2949 

-0.0462 

AST 

0.1756 

0.1307 

0.1236 

-0.087 

MD 

46.08% 

57.92% 

62.85% 

56.34% 

AC 

8.71 

127.29 

444 

N/A 

AT 

0.0471 

0.0406 

0.03 

N/A 

PCA 

31.19% 

35.40% 

38.71% 

N/A 


all stocks 


Portfolio Optimization Backtest - Daily Rebalancing 

ERI MV EW SfcP 500~ 

CR 49.78% 20.41% 18.20% -19.38% 

AR 10.61% 4.74% 4.26% -5.22% 

AS 0.6491 0.3581 0.2904 -0.0462 

AST 0.2518 0.1318 0.1205 -0.0187 

MD 53.32% 47.01% 61.24% 56.34% 

AC 7.45 11.00 134 N/A 

AT 0.0324 0.0231 0.03 N/A 

PCA 34.94% 33.23% 35.08% N/A 

(b) a < 2.2 

Table 4.7 Weekly rebalancing: backtest statistics 

Another issue that may be relevant here is the sensitivity of tail estimators 
(including Hill’s a) to non-i.i.d. data and volatility clustering. Consistency 
and asymptotic normality results for tail estimators require that n —)■ cx), 
fc —)■ cxo and fc/n —)■ 0 where k = k{n) is the number of observations con¬ 
sidered extremal (we use n = 1500 and k = 150). Asymptotically, volatility 
clustering featured in many popular models (e.g. GARCH) increases the ef¬ 
fective sample size by the reciprocal value of the average cluster size. In 
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addition to these asymptotic results, finite sample behaviour of each par¬ 
ticular estimator can be relevant as well (cf. Chavez-Demoulin and Davison 
|1], Drees [TB], and references therein). 


4.5 Weekly rebalancing 


In this part of the study we switch from daily to weekly rebalancing. The 
calculation of portfolio weights is still based on daily data. This allows to use 
all observations in the historical window, and not only the weekly returns. 
In some sense, trading once a week reflects the delayed execution of large 
orders. To avoid moving the market, trading of high volumes is often split 
into parts and executed step by step. 

The results of this experiment are shown in Tables 4T and |4.8[ For 
simplicity, the numbers for S&P 500 are taken from the tables on daily rebal¬ 
ancing. The overall picture is very similar to the daily rebalancing set-up: 
ERI compares favourable to its peers in all backtesting runs except for the 
one experiment with light-tail stocks. The particularly high performance im¬ 
provement on stocks with heavy tails achieved with daily rebalancing can 
also be achieved with weekly rebalancing. 


5 Conclusions 

Our backtesting results suggest that the Extreme Risk Index (ERI) could be 
useful in practice. Comparing basic implementations of the ERI methodology 
with the minimum-variance (MV) portfolio and the equally weighted (EW) 
portfolio, we obtained promising results for stocks with heavy tails. Tailored 
to such assets, the ERI optimal portfolio not only outperforms MV and 
EW portfolios, but it also yields an annualized return of 11.5% over 4 years 
including the financial crisis of 2008. This advantage should outweigh the 
higher transaction costs caused by the ERI based approach. Thus, taking into 
account the special nature of diversification for heavy-tailed asset returns, the 
ERI strategy increases the reward for the corresponding risks. 

Our study also shows that the MV and EW approaches can catch up with 
ERI optimization in some cases, especially when applied to stocks with lighter 
tails. Therefore a combined algorithm switching between ERI and variance 
as risk measure (depending on the current volatility in the market) may be 
a good choice. First empirical studies confirm this. However, the results 
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Portfolio Optimization Backtest - Daily Rebalancing 




ERI 

MV 

EW 

S&P 500 

CR 

30.31% 

17.23% 

18.13% 

-19.38% 

AR 

6.83% 

4.05% 

4.25% 

-5.22% 

AS 

0.5348 

0.3064 

0.2921 

-0.04620 

AST 

0.1935 

0.1273 

0.1232 

-0.0187 

MD 

43.82% 

55.95% 

63.68% 

56.34% 

AC 

7.41 

1.00 

243 

N/A 

AT 

0.0337 

0.0000 

0.03 

N/A 

PCA 

32.74% 

100.00% 

40.16% 

N/A 


(a) a e (2.2,2.6) 


Portfolio Optimization Backtest - Daily Rebalancing 




ERI 

MV 

EW 

S&P 500 

CR 

0.04% 

10.72% 

18.90% 

-19.38% 

AR 

0.01% 

2.57% 

4.41% 

-5.22% 

AS 

0.0884 

0.2378 

0.3012 

0.0000 

AST 

0.0343 

0.0957 

0.1281 

0.0000 

MD 

4'i33% 

40.60% 

64.01% 

'o.oo% 

AC 

3.49 

4.48 

67 

N/A 

AT 

0.0189 

0.0162 

0.03 

N/A 

PCA 

54.81% 

52.27% 

47.44% 

N/A 


(b) a > 2.6 

Table 4.8 Weekly rebalancing: backtest statistics 


obtained so far are not very stable, and the choice of the switching strategy 
needs a deeper investigation. Other improvements of the ERI methodology 
may be achieved by downweighting the crisis events when they reach the far 
end of the historical observation window and by smoothing the pattern of 
trading activities. All these questions will be subject of further research. 
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